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Abstract. The tree metric theorem provides a combinatorial four point 
condition that characterizes dissimilarity maps derived from pairwise 
compatible split systems. A related weaker four point condition char- 
acterizes dissimilarity maps derived from circular split systems known 
as Kalmanson metrics. The tree metric theorem was first discovered in 
the context of phylogenetics and forms the basis of many tree recon- 
struction algorithms, whereas Kalmanson metrics were first considered 
by computer scientists, and are notable in that they are a non-trivial 
class of metrics for which the traveling salesman problem is tractable. 

We present a unifying framework for these theorems based on com- 
binatorial structures that are used for graph planarity testing. These are 
(projective) PC-trees, and their affine analogs, PQ-trees. In the projec- 
tive case, we generalize a number of concepts from clustering theory, 
including hierarchies, pyramids, ultramctrics and Robinsonian matri- 
ces, and the theorems that relate them. As with tree metrics and ultra- 
metrics, the link between PC-trees and PQ-trees is established via the 
Gromov product. 

Keywords, hierarchy, Gromov product, Kalmanson metric, Robinsonian 
metric, PC tree, PQ tree, phylogenetics, pyramid, ultrametric. 



1. Introduction 

In his "Notebook B: Transmutation of Species" (1837), Charles Darwin drew 
a single figure to illustrate the shared ancestry of extant species (Figure 1). 
That figure is a pictorial depiction of a graph known as a rooted X-tree. 

Definition 1. An X-tree T = (T, 0) is a pair where T = (V, E) is a tree and 
: X — >• y is a bijection from X to the leaves of T. Two X-trees 7i = (Ti, 0i), 
72 = (72, 4'2) are isomorphic if there exists a graph isomorphism $ : Ti — )• T2 
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Figure 1. A page from Darwin's Notebook B on the Trans- 
mutation of Species showing a rooted X-trec (the root is 
labeled with "1") where the set X = {A, B, C, D}. 



such that (/)2 = ^ o <j>i. A rooted X-tree is an X-tree with a distinguished 
vertex r. 

The biological interpretation of rooted X-trees lies in the identification 
of the leaves with extant species, and the vertices along a path from a leaf to 
the root as ancestral species. Although the validity of trees in describing the 
ancestry of species has been debated [TS], trees are now used to describe the 
shared ancestry of individual nucleotides in genomes and they are perfectly 
suited for that purpose |S]. 

In phylogenetics, it is desirable to associate lengths with the edges of 
trees. Such lengths may correspond to time (in years), or to the number of 
mutations (usually an estimate based on a statistical model). This leads to 
the notion of a tree metric, that is conveniently understood via the notion 
of a weighted split system. Here and in what follows, for simplicity we often 
take X = {1,2, . . .,n}. 

Definition 2. A split S* of X is a partition A\B of X into two nonempty 
subsets. The corresponding split pseudometric is given by 

n r .^ fi if |{»,j}nA| = i, 

I otherwise. 

A split is trivial if|yl| = lor|i3| = l. A split system is a set of splits 
containing all the trivial splits. 
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Removing an edge of an X-tree disconnects the tree into two pieces and 
thus gives a spht Se of X. We can associate a spht system {Se\e E E{T)} to 
T by doing this for each edge. 

Definition 3. A dissimilarity map is a function D : X x X ^ R.>o such 
that D{i,i) ~ and D{i,j) — D{j,i) Vi,j G X. A dissimilarity map D is 
S-additive if there is a split system S such that 

D=Y, w{S)Ds 
ses 

for some non- negative weighting function w : S ^ R>o- If S is the split 
system associated to an AT-tree T, we say D is T-additive. D is a tree metric 
if it is T-additive for some A-tree. 

The following classic theorem precisely characterizes metrics that come 
from trees: 

Theorem 1. A dissimilarity map D is a tree metric if and only if for each 
i,j,k,l G A, the value 

max{i?(z, j) + D{k, I), D[i, k) + D{j, l),D{i, I) + D{j, k)} (1) 

is realized by at least two of the three terms. 

Equation [T] is called the four-point condition. Theorem [T] motivates the 
development of algorithms for identifying tree metrics that closely approxi- 
mate dissimilarity maps obtained from biological data. In molecular evolu- 
tion, dissimilarity maps are derived by determining distances between DNA 
sequences according to probabilistic models of evolution; more generally, al- 
gorithms that approximate dissimilarity maps by tree metrics can be applied 
to any distance matrices derived from data. However, dissimilarity maps de- 
rived from data are never exact tree metrics due to two reasons: first, as 
discussed above, some evolutionary mechanisms may not be realizable on 
trees; second, even in cases where evolution is described well by a tree, fi- 
nite sample sizes and random processes underlying evolution lead to small 
deviations from "treeness" in the data. 

It is therefore desirable to generalize the notion of a tree, and a natural 
approach is to consider adding splits to those associated with an A'-tree. One 
natural class of split systems to consider is the following. 

Definition 4. A circular ordering C ~ {xi, . . . , a;„} is a bijection between A 
and the vertices of a convex n-gon P„ such that Xi and Xi+i map to adjacent 
vertices of P„ (where a:„+i :— xi). Let Sij denote the split 
{xi,Xi+i, . . . ,Xj_i}\{xj,Xj+i, . . ■,Xi_i} and let S{C) = {S^^jli < j}. We say 
a split S is circular with respect to a circular ordering C if S* G S{C), and a 
split system S is circular if 5 C S{C) for some circular ordering C. 

Given a set E of circular orderings, let S{£) — ncG£5(C) be the sys- 
tem of splits that are circular with respect to each ordering in £. The split 
system associated with a binary A-tree arises in this way from a family £ 
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of 2"^^ circular orderings [55], and a tree metric is obtained by associating 
non-negative weights to each spht in the system. Kalmanson metrics, which 
were first introduced in the study of travehng salesmen problems where they 
provide a class of metrics for which the optimal tour can be identified in 
polynomial time |21j . correspond to the case when \£\ = 1. 

Kalmanson metrics can be visualized using split networks '19]. We do 
not provide a definition in this paper, but show an example in Figure 2 
(drawn using the software SplitsTree4) . The neighbor- net |3] and MC-net [13] 
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Figure 2. A Kalmanson metric (left) visualized as a split 
network (right). 



algorithms provide a way to construct circular split systems from dissimilarity 
maps, but despite having a number of useful properties [U I22[ . have not 
been widely adopted in the phylogenetics community. This is likely because 
split networks (such as in Figure 2) fail to reveal the "treeness" of the data. 
More specifically, the internal nodes of the split network do not correspond to 
meaningful "ancestors" as do the internal nodes in X-trees. Other approaches 
to visualizing "treeness", e.g. [5S], do reveal the extent of signal conflicting 
with a tree in data, but do not reveal in detail the splits underlying the 
discordance. 

We propose that PQ- and PC-trees, first developed in the context of 
the consecutive ones problem [T^ and for graph planarity testing [371 HI] j 
are convenient structures that interpolate between X-trees and full circular 
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split systems. The main resuh of this paper is Theorem |3] Given a Kalman- 
son metric, the theorem shows how to construct a "best-fit" PC-trec which 
reaUzes the metric and captures the "treeness" of it. 

Another (expository) goal of this paper is to organize existing results 
on PC-trees, their cousins PQ-trees, and corresponding metrics (Theorem [4] 
in Section [5]). We believe this is the first paper to present the various results 
as part of a single unified framework. As a prelude, we illustrate the types of 
results we derive using a classic theorem relating rooted X-trees to special 
set systems that encode information about shared ancestry. 

Definition 5. A hierarchy % over a set X is a collection of subsets of X such 
that 

1. X eH, and {x} e H for all x e X 

2. AnB A, B} for all A,B eU. 

The requirement that each {x} e H is not part of the usual definition 
of hierarchy but its inclusion here will simplify later results. We then have 
the following: 

Proposition 1. There is a natural bijection between hierarchies over X and 
rooted X-trees. 

Proposition [TJ which we will prove constructively in Section 2, is an ele- 
mentary but classic result and has been discovered repeatedly in a variety of 
contexts |12U16j . For example, in computer science, hierarchies are known as 
laminar families where they play an important role in the development of re- 
cursive algorithms represented by rooted trees (see, e.g. [H]). Hierarchies are 
also important because they are the combinatorial structures that underlie 
ultrametrics. 

Definition 6. An ultrametric is a symmetric function D : X x X ^ such 
that 

D{x, y) < max{_D(a;, 2), D{y, z)} Vx, y,z E X 

Definition 7. An indexed hierarchy is a hierarchy T-L with a non-negative 
function f : H R>o such that for all A, B e A C B ^ f{A) < f{B). 

The extension of Proposition [T] to metrics, proved in [5D], shows that 
these objects are the same. The proposition is an instance of a tree metric 
theorem that associates a class of combinatorial objects (in this case rooted 
X-trees) with a class of metrics (in this case ultrametrics). Our results or- 
ganize other tree metric theorems that have been discovered (in some cases 
independently and multiple times) in the contexts of biology, mathematics 
and computer science. 

In particular, we investigate relaxations of Definitions [l] |3] and [s] for 
which there exist analogies of Theorem [T] and Proposition [T] For example, 
hierarchies are special cases of pyramids [9j , which can be indexed to produce 



strong Robinsonian matrices |24| . Proposition 10 (originally proved in [5]) 
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states that these objects correspond to each other mimicking the correspon- 
dence between hierarchies and ultrametrics. 

In discussing tree metric theorems we adopt the nomenclature of An- 
dreas Dress who distinguishes two types of objects and theorems: the affine 
and the projective [TU] . Roughly speaking, these correspond to "rooted" and 
"unrooted" statements respectively, and we use these terms interchangeably. 
For example, a hierarchy is an affine concept whose projective analog is a pair- 
wise compatible split system. Similarly, unrooted X-trees are the projective 
equivalents of rooted X-trees, and tree metrics are the projective equiva- 
lents of ultrametrics. We'll see that Kalmanson metrics are to tree metrics 
as Robinsonian matrices [23] are to ultrametrics, and circular split systems 
are to pairwise compatible split systems as pyramids are to hierarchies. We 
use PQ-trees [2] and their projective analogs PC-trees [22] to link all of these 
results. 



2. Hierarchies, X-trees and split systems 

We begin by proving Proposition [l] both for completeness and to introduce 
some of the notation that we use. An AT-trec T = (V, E, (f>) has a natural 
partial ordering on its vertices: for distinct u,v Cz V, we say u ^ w if w lies on 
the unique path from u to the root. Given v £ V, let Hy = {x G X\x =4 
and a{T) = {Hy\v G V}. 

Proposition 2. The map a is a bijection from rooted X-trees to hierarchies 
over X . 

Proof. Let (T, 0) be a rooted X-tree with root r. Hr = X and = 
{a;} for all x £ X,so'H satisfies (1) of Definitionjs] Consider any two Hy € 
a(T). Ifu ^ V then HuHHy — iiv ^ u then H^nHy = Hy, and otherwise 
Hu n Hy = 0. So each pair of elements in a{T) satisfies (2) of Definition [5] 
and a{T) is a hierarchy. 

For the reverse direction, let H be a hierarchy over X. Let T = (V, E) 
be the digraph with V = {va\A £ "H} and with edges denoting minimal 
inclusion: T has an edge from vg to w^i iff yl C i? and there does not exist 
C e H such that A C C C B. We will show that T is a tree. First note 
that by induction on |C| each vertex vc with C ^ A is connected to vx and 
has at least one parent. Now suppose va, vb are distinct parents of vc- Then 
^ n B 7^ 0, so without loss of generality by the hierarchy condition A C B. 
But then C C A C B, a contradiction. Thus T is connected and has one fewer 
edge than vertices, so T is a tree with root vx- Define the map cj) : X ^ V 
by = "f^x- This is a bijection from X to the leaves, so T = {T,4') is a 
rooted A-tree with a{T) = H. □ 

The above proposition gives a characterization of rooted A-trees in 
terms of collections of subsets of A. We turn now to the projective analogue 
of rooted A-trees. 
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Definition 8. Two X-splits A2\B2 are compatible if one of AinA2, AiD 

B2, Bi n A2, Bi D B2 is empty, and are incompatible otherwise. 

Removing an edge e from a projective X-tree disconnects the tree into 
two pieces and thus gives an X-spht Se- Let /3(T) = {Se\e € E{T)}. It is easy 
to check that (3{T) is a pairwise compatible spht system, and a basic theorem 
of X-trees [5] [23] shows every pairwise compatible split system arises in this 
way, giving 

Proposition 3. /3 is a bijection from the set of projective X -trees to the set of 
pairwise compatible split systems over X . 

Finally we show that pairwise compatible split systems and hierarchies 
are in bijection. Fix r € X and let 5 be a set of pairwise compatible splits 
of X. The unrooting map 7^ sends a split S to the component of S that 
does not contain r. The rooting map dr sends a set A C X \ {r} to the split 
dr{A) = X\X \ A. If 5 is a split system, let 7^(5) = {-fr{S)\S e S}. 

Proposition 4. S is a pairwise compatible split system over X if and only if 
7r(5) is a hierarchy over X \ {r}. 

Proof. Choose 51,5*2 G S, with Si = Ai\Bi. If 5i,52 are compatible, then 
without loss of generality Ai n A2 = 0. If r G Bi,B2, then 7^(5^) = Ai, so 
7r(5i) n 7r(52) = 0. If r e Ai, then r ^ A2 since ^1 n = 0, and therefore 
r e i?2. In this case 7r(5i) ~ X \ Ai and 7r(52) = A2, so 7r(5i) n 7r(52) = 
A2 = 7r(52), again satisfying the hierarchy condition. The final case follows 
by symmetry. Conversely, suppose 7r('5) is a hierarchy. Then for any two 
81,82 e 5 we have 7r(5i) n7r-(52) S {0, 7,-(5i), 7^(52)}. Checking the cases 
as above shows that 5i and 52 must then be compatible splits. □ 

We define the map to take an afhnc X \ {r}-tree T to the projective 
X-tree obtained by attaching a vertex with label r to the root of T. The 
inverse map takes a projective X-tree T to the affine X-tree as follows: 
let V be the vertex of T labelled r. Then Xr{T) is obtained by rooting at the 
neighbor of v, and then deleting v. 

Proposition 5. If AT and H are the sets of all affine trees and hierarchies 
over X \ {r}, respectively, and PT and PSS are the sets of all projective trees 
and pairwise compatible splits systems over X, respectively, then the following 
diagram commutes: 



AT 



PT 



a 



H 



PS8 



Each arrow is a bijection; the unlabelled arrows are the inverses of the maps 
going in the other direction. 
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3. PQ-trees 

We start our generalization of Proposition [5] with a generalization of rooted 
X-trees. 

Definition 9. A PQ-tree over X is a rooted X-tree in which every vertex 
comes equipped with a linear ordering on its children. Every internal vertex 
of degree three or less is labeled a P-vertex, and every internal vertex of 
degree four or more is labeled either as a P-vertex or a Q-vertex. We say two 
PQ-trees 71,72 are equivalent (we write 7i ^ T2) if one can be obtained from 
the other by a series of moves consisting of: 

1. Permuting the ordering on the children of a P-vertex, 

2. Reversing the ordering on the children of a Q-vertex. 

We draw a PQ-tree by representing P-vertices as circles and Q-vertices 
as squares, and ordering the children of a vertex from left to right as per the 
corresponding linear order (see Figure 3). For any PQ-tree T over A", define 




BFE DGAC AGDC BGF 

Figure 3. Four PQ trees. Ti,T2 and T3 are equivalent to 
each other but T4 is different from the other three. 

the frontier of the tree as the linear ordering on X derived from reading the 
leaves of T from left to right. Let con{T) = {frontier{T')\T' ^ T} be the 
set of all linear orderings -< that are consistent with the PQ structure of T. 
We say I is an interval with respect to -< if there exist a,b d X such that 
I = {t\a ^ t ^ b}. Define a to be the map that sends the PQ-tree T to 
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the set of all / C X such that / is an interval with respect to every linear 
ordering in con{A). 

Lemma 1. a(T) is a hierarchy if and only if every vertex of T is a P-vertex. 
If so, let T' be the corresponding normal affine X-tree. Then a{T) = a{T'), 
where oi{T') is the hierarchy constructed in Proposition^ 

Proof. Let v be an internal vertex of T, {ci, C2, . . . , c„} the set of its children, 
and recall — {x £ X\vx ^ v} is the set of all the elements x such that the 
path from Vx to the root includes v. Now Hc^ U • • • U Hc^ is in a(T), and if 
every vertex of A is a P-vertex then every element of a(T) will be of this form. 
In this case a{T) is identical to the hierarchy constructed in Proposition [2j 
Now suppose T has a Q- vertex v. Then n > 3 and both A — U and 
B = U -ffc3 are in a(T), but An B = H^^ is nonempty, so a(T) is not a 
hierarchy. □ 

This shows that the map a on PQ-trees agrees with the a in Proposition 
[5j It also shows that the usual affine X-trees are precisely PQ-trees with all 
P-vertices. Since PQ-trees do not necessarily give rise to hierarchies, we seek 
a different combinatorial characterization of them. 

Definition 10. A collection of subsets P of A is a prepyramid if 

1. X eV and {x} G V for aU x G A, 

2. There exists a linear ordering ^ on A such that every A e P is an 
interval with respect to ^. 

■p is a pyramid if, in addition, it is closed under intersection. 

If T is a PQ-tree then a{T) is a prepyramid with respect to any 
f rentier [T), and a(T) is the associated prepyramid. 

Definition 11. Two subsets A,B of A are compatible if A n i? G {0,A, B}. 
Otherwise they are incompatible A rooted family over A' is a collection of sets 
J such that if A, _B e J" are incompatible, then Ar\B, A\B,B\A, and A\JB 
are in J- . 

We are now ready to state the main result of this section. 

Proposition 6. The map a is a bijection from PQ-trees to prepyramids that 
are rooted families. 

Proof. If T is a PQ-tree, a(T) is obviously a prepyramid. We now show it is 
also a rooted family. Trivially A e a(T) and {x} G a{T) for all x g A. Let 
A, B be incompatible sets in J- and con{A) a linear ordering. We can write 
A = {t\xA ditdi Va] and B = {t\xB :< t ^ ys} for some xa,xb,Va,Vb e A, 
and by incompatibility we can assume xa ^ xb di Va ^ Ub- Then An B = 
{t\xB di t d va}, AUB = {t\xA d t d ys}, A\B ^ {t\xA d t -< xb} and 
B \ A — {t\yA ^ t d Vb}- Since each of these four sets is an interval with 
respect to -< for every con(A) they are each in a{T). 

It remains to show that if is a collection of subsets of A which is a 
prepyramid and a rooted family, then there exists a unique PQ-tree T such 
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that a(T) = J- . Let F' Q F consist of the sets in F that are compatible 
with all of F. Then the elements of F' are pairwise compatible, so F' is a 
hierarchy and corresponds to a tree T', as in Proposition [2j Consider the 
vertices vc in T' corresponding to subsets of the form C — A U B with 
A,BgF incompatible, and mark those vertices as Q-vertices. For each such 
Vc there is a natural ordering on its children vc\,vc2, ■ ■ ■ ,'>^c„ as follows: 
vci < if the labels of the leaves in the subtree rooted at Vd are all ~< the 
labels of the leaves of the subtree rooted at vcj , where -< is the order from 
the rooted family condition. Ordering the children of the Q-vertices of T' in 
this way, we obtain a PQ-tree T. 

We will use induction on to show a{T) = F and that T is the only 
such PQ-tree for which this is true. First, suppose F' contains a set ^ 7^ X, 
\A\ > 1. Define Fi = {C e F\A C C or ACi C = Hi} and F2 ^ {C e F\A D 
C} . Fi O F2 — A, and because A is compatible with everything F1UF2 = F. 
Fi is a prepyramid over {X \ A) U {A}and F2 is a prepyramid over A, and 
both are rooted families with \Fi\ < \F\. Then the inductive hypothesis shows 
there are PQ-trees 71,72 such that a(7i) = Fi for i = 1,2. Now 71 has a 
leaf corresponding to A, and the root of T2 also corresponds to A. Grafting 
T2 onto the leaf A in 71 gives a PQ-tree T with a{T) — a(71) U a(72) — F. 
Conversely, if 7~ is a PQ-tree with a{T) = F, T must have a node v such 
that the subtree 71 with root v satisfies a (71) — F\^ and the tree T2 obtained 
by replacing the subtree 71 with the vertex v and label A gives a (72) = F2- 
The inductive hypothesis gives the uniqueness of 71 and 72, which in turn 
implies the uniqueness of T. 

Now suppose no such set A exists, so F' — {{a;i}, . . . , {a;„}, X}. If 
F ^ F' then is a hierarchy and the PQ-tree of depth one with root a 
P-vertex is the unique tree such that a{T) = F. We now consider the final 
case, where F contains sets other than X and the {x}s, and every element 
in F except for these is incompatible with another element of F . Without 
loss of generality assume xi -< X2 -< ■ ■ ■ ^ Xn, where -< is the ordering 
under which is a prepyramid. Then every element of F is of the form 
X[i:j] := {xi,Xi+i, • • • , Xj} for 1 < i < j < n. Let T be the PQ-tree of depth 
1 with a Q-vertex root and children Xi. Since a{T) — {x[i.j]\l < i < j < n} 
it's clear a{T) 5 F. 

We must show F = aiT'), or equivalently, that x^i-j] G F for all i < j. 
We will prove this by induction on n. It's obvious for n = 3, so suppose 
n > 4. Let A ^ X he a set in F that is maximal under inclusion. By 
assumption |A| > 1 and A is incompatible with some B ^ F. By the rooted 
family condition AU B E F so we must have AU B ~ X hy the maximality 
of A. Without loss of generality we may write A = B = x^i^n] with 

i < j. We claim j — n — 1. For by the rooted family condition F contains 
A \ B — An B ~ and B \ A ~ a;[j+i:n]- If j ^ n — 1, B \ A is 

incompatible with some other set C E F. C must be of the form C — x^^^.k^j 
with ki < j + 1 < k2 < n. If ki — 1 then C Z) A contradicting the maximality 
of A. Then ki ^ 1, so A and C are incompatible and AU C ^ x^i.^^] E F, 
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contradicting the maximality of A. Thus j ~ n — 1 and X[i.„_i] G J-. By the 
same reasoning, X[2:n] G J'- 

Now let G = {A g T\A C X[i.n-i]}- Since X[i:„_i] G G,G t-S aprepyramid 
over X \ {xn}- It is also a rooted family. We claim that every element A £ G 
with A 7^ {xi},v4 X \ {xn} is incompatible with some other element of 
G- To see this, write A = x^i-j], j < n — 1 and suppose j ^ n — 1. By 
our initial assumption A is incompatible with some set B — x^^i.j,]^ e J-. If 
/ ^ n then B G G gives the incompatible set. Otherwise B and X[i;n-i] 
are incompatible, so B n X[i.n_^ = a;[i':„_i] is in G and is incompatible with 
A. Finally, suppose j = n — 1. Then A is incompatible with a;[„_i.„], so 
A U a;[„_i:„] = a:^[i:ri] G -7^- This must be incompatible with some other set 
B G J^. We can write B — x^ii.ji], i < / < n — 1, so S G ^ is incompatible 
with A and the claim is proved. 

It follows that C/ is a rooted prepyramid over X\{xn}, and every element 
in G other than X and the {xi}s is incompatible with some other element 
of G- By our inductive hypothesis, £ G for each l<i<j<n — 1. 

We showed that X[„_i.„] G J^, and for each i the sets a;[.i.„_i] and a;[„_i.„] 
are incompatible, so their union a;[i.„] is in Thus = {x[^.j]\l < i < j < 
n}. ' ' □ 



4. PC-trees 

We now describe the projective equivalent of PQ-trees. 

Definition 12. A PC-tree over X is an X-tree where each internal vertex 
comes equipped with a circular ordering of its neighbors. Additionally, each 
internal vertex of degree less than 4 is labelled a P-vertex, and each other 
vertex is labelled either a P-vertex or a C-vertex (Figure 4). Two PC-trees 
7i,72 are said to be equivalent (we write 7i ~ Ti) if one can be obtained 
from the other by a series of the following moves: 

1. Permuting the circular ordering of the neighbors of a P-vertex, 

2. Reversing the circular ordering on the neighbors of a C-vertex. 

For a PC-tree T, let frontier{T) be the circular ordering given by reading 
the taxa in either a clockwise or counterclockwise direction. Let con{T) — 
{fronUer{V)\V ^ T} and let f3{T) = ncgcon(r)'5(C). We say l3{T) is the 
circular split system associated to T- 

Definition 13. A split system S is an unrooted split family over X if, for 
each pair of incompatible splits Si = Ai\Bi,S2 = ^21^2 in S, the splits 
Aid A2\Bi(J B2, AiCi B2\A2(J Bi, A2r\ Bi\AiU B2, and BinB2|^iU^2 are 
all in S as well. 

Following the proof in Proposition [6] that a{T) is a rooted family for 
every PQ-tree T, one can show /3(T) is an unrooted family for every PC-tree 

r. 



Lemma 2. For any PC-tree T , l3(T) is an unrooted split family. 
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Figure 4. Four PC-trees. Ti,T2 and T3 are equivalent to 
each other but Tj^ is different from the other three. 

We now generahze the map 7^ to PQ- and PC-trees. 

Definition 14. The unrooting map sends a PQ-tree T over X \ {r} to the 
PC-tree K,r{T) as foUows: attach a vertex labelled r to the root of T. If vertex 
V in T has children {vi, V2, - ■ ■ ,Vk} with linear ordering vi -< V2 -< ■ ■ ■ ~< Vk 
and parent w, in the vertex has the same neighbors with circular 

ordering {vi, . . . ,Vk,w}. 

The rooting map A^. sends a PC-tree T over X to the PQ-tree over X\{r} 
obtained by rooting at the vertex adjacent to r, deleting r, and replacing each 
C vertex with a Q vertex. Let v be such a vertex with a circular ordering 
C = {vi, . . . ,Vm}', we may assume the path from v to the root passes through 
Vm- Then in Xr{T), vertex v has parent Vm and children Vi,. . . ,Vm-i with 
linear ordering vi ^ V2 ^ ■ ■ ■ ^ 

Since Hr and are inverses, this immediately gives the following: 

Proposition 7. Kr is a bijection from PQ-trees over X \ {r} to PC-trees over 
X. 
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Recall that if /S is a split of X, the map 7^. takes S to the component 
of S not containing r. 

Proposition 8. Let CUF he the set of circular split systems that are unrooted 
families over X, and let PP he the set of prepyramids that are rooted families 
over X \ {r}. Then the map 7^. is a bijection from CUF to PP. 

Proof. Sr and jr are inverses, so it suffices to show that ^r{CUF) C PRF 
and Sr{PRF) C CUF. Let 5 be a circular unrooted split family with circular 
ordering {xi, . . . ,Xn} and suppose r = Xi. Then 7r(»5) is a prepyramid with 
respect to the linear ordering on X \ {r} given by -< Xi^2 -<■■■-< Xi-i. 

Next, consider S E CUF and two incompatible sets G,H E 7r('5), 
with G = 7r(5'i) and H = jr{S2)- Assume 5*1 = Ai\Bi, S2 = are 
compatible as split systems with Ai (1 A2 = 9. If r G Ai, then r G B2 and 
G O H = {X — Ai) O A2 = A2 = H, contradicting the incompatibility of G 
and H. The other cases produce similar contradictions, so and S2 must be 
incompatible as split systems. Then the splits Ai (1 A2\Bi[J B2, Ai (1 B2\A2D 
Bi,A2nBi\AiUB2 and Bi ni?2|-4i U A2 are all in S. Assume without loss of 
generality that G = Ai,H = A2. Then -fr{{Ai n A2, Bi U B2}) = Aif\A2 = 
GnH G-fr{S), and similarly G n H,G\ H, H \G axe all in 7^(5), so 7^.(5) 
is a rooted family. 

Conversely, let be a pyramidal, rooted family over X \ {r} with linear 
ordering -<. 5r{J^) is circular and contains all the trivial splits, as {a;}|X \ 
{.t} = 7r({2^}) and \ {r} = 7r(A' \ {r}). The above argument reverses 

to show 6r{G), dr{H) G 6r{T) are incompatible as split systems only if G and 
H are incompatible as sets. In this case 6r{G (1 H) , Sr{G U H) , 6r{G \ H) and 
5r{H\G) are in 5r{T) so 5r{F) is an unrooted family. Thus, 5j.{PRF) = CUF 
and 7r(CJ7F) = Pi?F as required. □ 

The above propositions combine to show that the map Kr o o 7^ 
is a bijection from circular split systems to PC-trees, and that in fact for a 

PC-tree T, Kr o cT^ o 7r(T) is precisely the split system arising from T in 
the natural way. An example is shown in Figure 5. We thus have; 

Proposition 9. The following diagram commutes: 

PQ ^ PC 













a 














PRF < — ^ CUF 



where PQ is the set of PQ-trees over X\ {r} , PRF is the set of prepyramids 
that are rooted families over X\{r}, PC is the set of PC-trees overX, and 
CUF is the set of circular unrooted families over X. 
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5. Metrics Realized By PC-trees 

In this section we extend the tree metric theorem (Theorem [T]) by showing 
how to replace affine and projective trees with PQ- and PC-trees. To explain 
our approach, we recall that in Section 2 we reviewed the connection between 
trees and their representations as hierarchies and split systems: 



AT'. 



H . 



PT 

PSS 
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This correspondence can be extended to metrics as foUows pS] : 



AT 



PT 



H 



PSS 



TM 



Here U are uhrametrics and TM are tree metrics. The tree metric theorem is 
proved by diagram chasing: starting with a tree metric, the Gromov product 
is apphed (see Definition |16| below) , resulting in an ultrametric. A (unique) 
hierarchy representing the ultrametric can be obtained and then the PSS 
corresponding to the hierarchy can be derived by the unrooting map dr. This 
weighted PSS represents the tree metric. 

In this section we construct a PC-tree that best realizes a Kalmanson 
metric by a similar approach, constructing an analog of the above diagram 
with suitable PQ- and PC-tree counterparts (Theorem |4]). The extension 
requires some care, because the weighted PC-trees representing a Kalmanson 
metric may require extra zero splits. A key result (Theorem [s]) is that there 
is a unique PC-tree that minimally represents any Kalmanson metric. 

We begin by making precise the notion of a Kalmanson metric. 

Definition 15. A dissimilarity map D is Kalmanson if there is a circular 
ordering {xi, . . . , a;„} such that for for alH < j < fc < ^, 

ma,x{D{xt,Xj) + D{xk,xi),D{xux,) + D{xj,Xk)} < D{xi, Xk) + D{x.j,xi). 



Let T be a projective X-tree and C a circular ordering obtained by 
reading the taxa of T clockwise. If D is T-additive then D is Kalmanson 
with respect to C. Additionally, in this case one actually has equality in 
([2| for each i < j < k < I. Kalmanson metrics are thus generalizations of 
tree metrics obtained by relaxing the equality conditions of the four-point 
theorem. The following theorem, proved in [6], gives the Kalmanson metric 
equivalent of the four-point condition. 

Theorem 2. A metric D satisfies the Kalmanson condition if and only if there 
exists a circular split system S and weight function w : S ^ R"*" such that 
D = y w{S)Ds. If it does, the decomposition is unique. 



Proof. Suppose D — w{S)Ds for some split system S that is compatible 

ses 

with respect to a circular ordering C ~ {xi, X2, . • . , Choose i < j < k < I 



(2) 




Sg5 
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and S — A\B e S. One can check that 

Ds{xi,Xk)+Dsixj,xi)-Dsixi,Xj)-Ds{xk,xi) = 

(3) 

so D satisfies the Kalmanson condition. 

Conversely, assume D is Kalmanson with respect to the circular ordering 
{xi, . . . , Xn}- Define 

2a{iJ) = D{xi,Xj) + D{xi_i,Xj_i) - D(a;j,a;j_i) - D{x^_i,Xj). 

The Kalmanson condition shows this is non-negative. 

Recall that Si,j := {xi, Xi+i, . . . , Xj^i}\{xj,Xj^i, . . . , Xi-i\. The system 
S = {'S'i.j}i<j is clearly circular. We claim 

D = Y,a{i,j)Ds^,,. (4) 

To see this, rewrite the right hand side of Q, expanding the aii, j) and group- 
ing together the coeflacients of each D{xi,Xj). This gives D — J2i<j D{xi,Xj)cij 
with 

2c,,, = i?5,, + ^5.+,,,+, - i?s,+,,, - . (5) 

Now Cij{xk, xi) — SikSji. This proves the correctness of Q and thus shows 
that D comes from a weighted circular split system. 

For a circular ordering C there are (2) splits in S and by ([5| the di- 
mension of metrics that are Kalmanson with respect to C is also (2) , so for a 
fixed circular ordering the weighting is unique. Now suppose D is Kalmanson 
with respect to two distinct circular orderings Ci , C2 . Let Si be the split sys- 
tem given by Ci, and consider the decomposition D = a(fc, l)Dst , with 

k<l 

respect to Ci. If Si,j is circular with respect to Ci but not with respect to C2, 
then without loss of generality there exists some fc, I with i < k < j < I such 
that {xi,Xk,Xj,xi} is cyclic with respect to Ci and {xi,Xj,Xk,xi} is cyclic 
with respect to C2 ■ This implies 

D{xi,x-j) + D{xk., xi) > D{xi,Xk) + D{xj,xi) > D{xi,Xj) + D{xk, xi), 

where the first inequality comes from the Kalmanson condition on Ci and the 
second comes from the Kalmanson condition on C2 ■ So we have equality, and 
by (§, 

= D{xi,Xj)+D{xk,xi)-D{xi,Xk)-D{xk,xi) ^2 ^ w{S) > w{Sij), 

S = A\BeS 

ikeA.jleB 

where the inequality follows since 5*^^- is in the summand. So a{i,j) = 
w{Sij) = 0, and the only nonzero terms in the decomposition of D with 
respect to Ci correspond to splits in Si which are also splits in This 



2 Xi,Xj e A,Xk,xi e B, 
otherwise. 
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shows the decomposition of D is unique, and thus the map ly from weighted 
circular spht systems to Kalmanson metrics given by 

iy{S, w){x, y) = ^ w{S)Ds{x, y) 
SeS 

is a bijection. □ 

Let ^ = i'^^ be the map that takes a Kahnanson metric to the weighted 
circular spht system that describes it, and let D be Kalmanson with £,{D) = 
{S, w). We want to find a PC-tree T such that /3(T) = S as this would provide 
a nice encapsulation of the "treeness" of our metric, but by Proposition|9]such 
a tree exists if and only if S is an unrooted family, which is not necessarily 
the case. There is, however, a canonical best choice. 

Theorem 3. Let D be a Kalmanson metric. There is a unique PC-tree T 
and weighting function w : f3{T) R>o such that the weighted circular split 
system (/3(T),iy) gives rise to D, and such that the number of zero weights 
\{S £ l3{T)\w{S) — 0}| is minimal. 

Proof. Define the closure map t : 5 — > 5, where S is constructed via the 
following algorithm: 
S^S_ 

while S contains a pair of incompatible splits Ai\Bi, A2\B2 do 

S ^ SU{AinA2\BiUB2,AinB2\A2UBi,A2nBi\AiUB2,BinB2\AiU 
A2} 

end while 

Since X is finite the above algorithm must terminate. By construction 5 is a 
circular split system and an unrooted family, and if S' is an unrooted family 
with S' 3 S, then we must also have S' 3 S. By Proposition [o] there is 
a unique PC-tree T with /3(T) = S. We have shown that if T' is another 
PC-tree with (3{T') 5 5, then /3(T') D /3(T), so in a well-defined sense T is 
the "best-fit" PC-tree for D. Let ^(-D) = (5, w) be the weighted circular split 
system corresponding to D and let w be a weighting on S given by extending 
w as 

'w{S) SeS, 
SeS-S. 



MS) 



Then ^{Sjw) ~ D and if is a weighted circular split system with 

^((5', w')) = D, then S' dS and w' onS,w' = on S' -S. □ 

We now explore how this construction looks on the affine side. 

Definition 16. Let _D be a metric on X and choose r E X. The Gromov 
product based at r is defined by 

20, (D) (x, y) = D{x, y) - D{x, r) - D{y, r) yx,yeX\ {r}. (6) 

The Gromov product is also known as the Farris transform |lll I15j in 
pliylogenetics. It is easy to check that the map 

MR)ix, y) = 2R{x, y) - R{x, x) ~ R{y, y). 
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satisfies ipr ° (f>r{D) = D and so is its inverse. 

Definition 17. A matrix R is Robinsonian over X if there exists a linear 
ordering ^ oi X such that 

'max{R{x, y),R{y, z)} < R(x, z) "ix ^ y ^ z. 

i? is a strong Robinsonian matrix if, in addition, for all w ^ x ^ y ^ z, 

R{x,y) = R{w,y) R{x, z) = R{w, z), (7) 

R{x,y) ^ R{x,z) =^ R{w,y) = R{w,z). (8) 

In [7] it is shown that if D is Kalmanson then (f>r{D) is a strong Robin- 
sonian matrix. Here, we give a slightly more precise characterization of the 
image. 

Lemma 3. Let D be a Kalmanson dissimilarity map and R ~ (f)r{D). Then 
R is a strong Robinsonian matrix with the following properties: 

1. R{x, y) < for all x,y £ X , 

2. For every w ^ x ^ y ^ z, 

R{x,y) + R{w,z) < R{x,z) + R{w,y). (9) 

Furthermore, 4>r is a bijection from Kalmanson dissimilarities to the 
space of these matrices. 

Proof. Suppose D is Kalmanson with respect to the order {xi, X2, . . ■ , Xn,r} 
and R — (j)r{D). It is immediate from the definition of the Gromov product 
([6| and the Kalmanson condition ([2| that 4>r{D) satisfies the above conditions 
with linear ordering xi ^ X2 < . . ■ < Xn- For w < x ^ y < z, 

2{R{x, z) - R{x, y)) = D{x, z) + D{y, r) - D{x, y) - D{z, r) > 0, 

so R{x,z) > R{x,y). Similarly R{x,z) > R{y,z), so R is Robinsonian. Now 
assume R{x,y) = R{x,z). Then (|9| gives R{w,z) < R{w,y), and since R 
is Robinsonian we also have the reverse inequality, so R{w,z) — R{w,y). 
Similarly if y) = R{w, y) then R{x, z) — R{w, z), so R is strong. 

Conversely, let i? be a strong Robinsonian matrix satisfying Q. Then 
ipr{R) clearly satisfies the Kalmanson conditions. Also, 

M^, V) = {R{x, y) - R{x, x)) + (Rix, y) ~ R{y, y)) > 0, 

and tpr{x,x) — for all x,y G X. So ^pr{R) is a Kalamanson dissimilarity. 
The maps (j)r and ipr are inverses, completing the proof. □ 

Therefore the image of consists of negative strong Robinsonian ma- 
trices satisfying a kind of four-point condition Q. 

Next we define the afSne analogue of weighted circular split systems. 

Definition 18. Let T' be a pyramid. A function / : T-" — > R is an indexing 
function if A C B =^ /(A) < f{B) for aU A,B e V. We cah (P,/) an 
indexed pyramid. 
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Definition 19. A subset A C X is maximally linked pL^ with respect to a 
Robinsonian matrix R if there exists d G R such that R{x, y) < d for all 
x,y G A, and A is maximal in this way. If A is such a set, define the diameter 
of A to be diam{A) — max^. j,^^ R{x, y). 

Let M (i?) denote the set of maximally linked sets with respect to Robin- 
sonian matrix R and define the function / : M.{R) — R by f{A) — diam{A). 

Proposition 10. The map t : R (M(R), f) is a bijection from Robinsonian 
matrices to indexed prepyramids, and from strong Robinsonian matrices to 
indexed pyramids. 

Proof. Suppose A e M.{R) for R Robinsonian and let a, 6 G A be the leftmost 
and rightmost points in A. Then for every a ^ x < y ^ b, R{x, y) < R{a, b), 
so diam{A) = M{a,b). This shows x G A for all a ^ a; ^ &, so every set 
in M{R) is an interval. Now suppose A,B£ M.{R) with A C B. Let A = 
[xi,yi],B = [x2,y2]- Then X2 di Xi di yi di 2/2, and 

f{A) = diam{A) = R{xi,yi) < R{x2,y2) = diam{B) = f{B), 

where the inequality follows from the fact that ^ is a maximally- linked set. 
So / is an index and r(i?) is an indexed prepyramid. 

Conversely, consider the map fi from indexed prepyramids to matrices 
given by 

fi{V,f){x,y)^ min f{A). 

Let R = niVJ). Given x d y d z, let E ^ {A e V\x,y € A}, F = {A € 
V\x, z e A}. Then F C E, so 

R{x,y) = min /(A) < min /(A) = R{x,z). 

Similarly i?(y, z) < R{x, z), so R is Robinsonian. It is easy to check that V 
consists precisely of the sets that are maximally linked with respect to R, so 
T and fj, are inverses. 

Now suppose i? is a strong Robinsonian matrix. We must show t{R) is 
closed under intersection. Let A = [ai,bi],B = [02,^2] be sets in V, suppose 
ai ^ 02 d bi ^ 62 and let C = AnB = [02, bi]. We will show C is a maximally 
linked set with diameter R{a2, 61). If x >- 61, the Robinsonian condition gives 
R{a2,x) > R{a2,bi). If there was equality then by the strong Robinsonian 
condition R{ai,x) = R{ai,bi) and x G A, a contradiction. Similarly, there 
is no a; ^ 02 with R{x,bi) = R{y,ai), so C G V and V is closed under 
intersection. 

Conversely, suppose {V, f) is an indexed pyramid and let R = /x((7-', /)). 
Because V is closed under intersection, for each A <Z X there is a unique 
A e V such that A C A,_ and A C B for &\\ A Q B G V. This follows 
immediately from taking A — PlAcsep ^- now, suppose w ^ x ^ y ^ 
z and R{x,y) = R{x,z). The set A :— {x,y} Ci {w,y} is in V since V is 
closed under intersection. f{{x,y}) — f{{x,z}) which implies z G {x,y}. 
Now x,y Cz A implies {x,y} € A, so z G A. But then z e {w,y} which 
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gives R{w,z) — R{w,y). A similar argument shows R{x,y) — R{w,y) 
R{x, z) = R(w, z), so i? is a strong Robinsonian matrix. □ 

Given two elements A, B Cz V we say i? is a predecessor oi A ii A C B 
and there does not exist C E V such that A C C C B. 

Lemma 4. Let V be a pyramid. Then each set in V has at most two prede- 



Proof. Suppose there is an A = [a, b] E V with three distinct predecessors 
Bi — [ai, bi], i — 1,2, 3. Because V is closed under intersection Bi O Bj = A 
so either a.i — a or hi — b. By the pigeonhole principle two of the BiS must 
share an endpoint, so assume ai = a2 = a. Then either Bi C B2 or B2 C Bi 
contradicting the fact that each Bi is a predecessor of A. □ 

For a set A £ V, let Pa = {-Pi} denote the predecessors of A. If {V, f) 
is an indexed pyramid, we define the map : 7^ — > R as the unique function 
satisfying 

\~f{A) if|PA|=0, 

wiA) = l-f{A) + f{P,) if|PA| = l, (10) 

[-/(A) + /(Pi) + /(P2) - fiPi U P2) if \Pa\ = 2. 
By Lemma |4] this is well-defined. 

Proposition 11. Let R be a negative strong Robinsonian matrix satisfying 
the Robinsonian four-point condition, and take t{R) = (V,f). Then f is 
negative, and w{A) > for all A G V. Furthermore, every such indexed 
pyramid lies in the image of t . 

Proof. Let P be a negative Robinsonian matrix satisfying ([9]). Clearly (10) 
holds for \Pa\ — because / is negative, and holds for \Pa\ — 1 because 
Pi A /(Pi) > f{A). So now assume A — [x,y] has two predecessors. 
By the argument in Lemma |4] these must be of the form Bi = [w,y] and 
B^ = [x, z] for some w ^ x < y ^ z, so 



w{A) - -f{[x, y]) + fi[w, y]) + f{[w, z]) - fi{w, z}) 
= —R{x, y) + R(w, y) + R(w, z) — R{w, z) 
>0, 

because R satisfies the Robinsonian four-point condition. This argument is 
reversible, so we see r really is a bijection. □ 



The requirement that w{A) > for A with two predecessors ( 10 ) is thus 
a kind of four-point property for pyramids, and we will refer to it as such 
later. 

Let rjr be the map sending [V, f) to the weighted circular split system 
[S,w') given by 5 = {5r{A)\A G V}, w'{5r[A)) = w{A). 

Proposition 12. If D is a Kalmanson metric, then v o rjr o t o cj)^ is the identity 
map. 
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Proof. Let D' = 1^0 TjrOTo 0^(1?) and for A&V, let Oa = {B e V\A C B) 
be the sets over A. A spht 5r{A) separates x,y £ X \ {r} ii x £ A or y e A 
but not both. So the spht pseudometric Ds^(^a){x, y) is 1 iff A e O^^j \ 0[x^y} 
or Ae 0{y} \ 0{2;,y}- Then 

D'{x,y)^ w{A)- Yl E '^'(^)- E "^(^)- (11) 

AeO{,} agO{,,„} '4eO{^} agO{,_„, 

Now Oa — Oa, so by an easy induction 

Y HB)^ E ^{B) = -f{A), 

BeOA BeOA 

and D'ix, y) = 2/ - f{{x}) - f[{y})- Since f{A) - dzam(A), by the 

definition of the Gromov transform, 

D'{x, y) = D{x, y) - D{x, r) - D{y, r) + D{x, r) + D{y, r) = D(x, y). 

To compute D' {x, r) we note a; and r are separated by a spht 5r{A) iff a; e yl, 
so 

D'{x,r)^ Y wiA) = -f{M)^-MD){x,x) = Dix,r). 



Let i? be a Robinsonian matrix over X. If the prepyramid V in t{R) is 
a rooted set family, Proposition [6] shows there exists a PQ-tree T such that 
a(T) — v. Unfortunately this is usually not the case, so we seek instead to 
find a "best fit" tree. Analogous to the projective case, we construct the rooted 
closure V oiV with the following algorithm. 

while V contains a pair of incompatible sets A, B do 

V ^ H\J{A\JB,AC^B,A\B,B\A) 
end while 

Let 9 be the closure map sending V to V. We then have the affine analog of 
Theorem [S] 

Lemma 5. The PQ-tree T with a{'T) = 9(V) is the unique tree with a{T) 3 V 
that minimizes |q!(7~)|. 

It remains to show that 9 commutes with the rest of the diagram. Let 
D be a Kalmanson metric, S the corresponding split system and V the as- 
sociated indexed pyramid. There is not necessarily a bijection between the 
intervals in V and the splits in S] this can be seen, for example, because V 
is closed under intersection while S need not be. The splits in V that are 
not in S will get assigned weight zero by the map 77^, which is why the lower 
rectangle commutes, but the maps 9 and l forget about the weights so it's not 
clear that 9o5r — Loip^. Fortunately, for pyramids that arise from Kalmanson 
metrics this bijection holds. 

Lemma 6. 5r ° 9 o t o (j)r{D) = l o ^(D) for all Kalmanson metrics D. 
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Proof. Let ("P,/) — to (j)r{D) and let the corresponding weighted 

spht system. Suppose A e V but Sr{A) ^ S, or equivalently f{A) — 0. 
A = [a, b] is an interval with respect to the Robinsonian metric. If c ;^ 6 then 
M(a,c) > M{a,b) because c ^ ^, so 

< A/(a,c) -M(a,6) = /(a,c) - /(a,6) = - ^ + ^ 

a.cSB a.beB 

The first sumniand is a subset of the second, so there exists B E V with 
a,b E B, c ^ B and u'(-B) > 0. Letting c be the smallest element with c )^ b 
shows there exists a set B — [x.b] £ V with w{B) > and x ^ a. Similarly 
there exists a set C = [a, y] e V with w{C) > and y>~b. So A^BnC 
for sets B,C G V that correspond to splits of positive weight in S, and thus 
6r{A) G io r]{S). This completes the proof. □ 

We are now ready to state our final result that summarizes the bijections 
described above. Let PC be the set of all PC-trees, CUF the set of all circular, 
unrooted split families, WCSS the set of all weighted circular split systems, 
and K the set of all Kalmanson metrics, all over X. Let PQ be the set of all 
PQ-trecs, FRF the set of pyramidal rooted families, IP the set of negative 
indexed pyramids satisfying the pyramidal four-point condition, and SR the 
set of negative strong Robinsonian matrices satisfying the Robinsonian four- 
point condition, all over X \ {r}. 

Theorem 4. The following diagram commutes: 
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This gives a way of constructing the best-fit PC-tree for a given Kalman- 
son metric D: 

7" = o aT^ o 9 o T o (j)j.{D). 
An example illustrating Theorem |4] is shown in Figure 6. The PC-tree 
in the upper right reveals the tree structure in the Kalmanson metric (see 
also Figure 2). 

As a final remark, we note that the geometry of Kalmanson metrics is 
also explored in [55] where the Kalmanson complex is described. PC-trees 
are faces in this complex, and it should be interesting to understand their 
combinatorics in the face lattice. 
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Figure 6. An example illustrating Theorem |4] 
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Table 1. Nomenclature and abbreviations. 

AT AfRne (rooted) X-trees with root r 

PT Projective (unrooted) X-trees 

H Hierarchies over X \ {r} 

PSS Pairwise compatible split systems over X 

U Ultramctrics 

TM Tree metrics 

PQ PQ-trees over X \ {r} 

PC PC-trccs over X 

PRF Pyramids that are rooted families over X \ {r} 

CUF Circular split systems that are unrooted families over X 

IP Negative indexed pyramids satisfying 

the pyramidal four-point condition over X \ {r} 

WCSS Weighted circular compatible split systems over X 

SR Negative strong Robinsonian matrices satisfying 

the Robinsonian four-point condition over X \ {r} 

K Kalmanson metrics over X 
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